LPC-based inversion of the DRM articulatory model
نویسنده
چکیده
Articulatory representations are expected to bring better speech recognition results. This requires to estimate the parameters of a speech production model from the speech sound, problem known as acoustico-articulatory inversion. Known methods to solve this problem usually introduce a heavy computational cost. Alternately, it is known that Linear Prediction analysis offers an analogy with acoustic filtering. This analogy had been exploited to develop a less expensive analytic method applicable to the estimation of tube shapes discretized in equallength sections. We have extended the method to the DRM case, where the tube is made of unequal-length sections. The proposed DRM inversion scheme is thus simpler and faster. Furthermore, it shows good performance in terms of low residual modeling error. It also enhances speech recognition results when used to compute Log Area Ratios.
منابع مشابه
Lpc Modeling with Speech Production Constraints
Despite the approximations it supposes, performing LPC-based acoustico-articulatory inversion is justified in some applicative frameworks. By illustrating this assertion through experiments aiming at incorporating speech production constraints from the DRM model and from a factor-based model into an LPC modeling scheme, we promote the use of LPC-based inversion as an interface between Productio...
متن کاملAcoustic-to-articulatory inversion using a speaker-normalized HMM-based speech production model
Acoustic-to-articulatory inverse mapping is a difficult problem because of its non-linear and oneto-many characteristics. We have previously developed a speech inversion method using a hidden Markov model (HMM)-based speech production model which takes into account the phonemespecific dynamic constraints of articulatory parameters. We found that the constraint significantly decreases the estima...
متن کاملRelating LPC modeling to a factor-based articulatory model
This paper proposes a method for recovering the articulatory parameters of a factor-based vocal tract shape model from the speech waveform. This is realized by analytically relating the shape model to a Linear Prediction lattice lter. Results pertaining to human vowels are presented. They show a good agreement with phonetic characteristics in a real-time computational framework.
متن کاملUnsupervised vocal-tract length estimation through model-based acoustic-to-articulatory inversion
Knowledge of vocal-tract (VT) length is a logical prerequisite for acoustic-to-articulatory inversion. Prior work has treated VT length estimation (VTLE) and inversion largely as separate problems. We describe a new algorithm for VTLE based on acoustic-to-articulatory inversion. Our inversion process uses the Maeda model (MM, [1,2]) and combines global search [3] and dynamic programming for tra...
متن کاملSparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: benefit to speech recognition
Speech recognition using articulatory features estimated using Acoustic-to-Articulatory Inversion (AAI) is considered. A recently proposed sparse smoothing approach is used to postprocess the estimates from Gaussian Mixture Model (GMM) based AAI using MinimumMean Squared Error (MMSE) criterion. It is well known that low-pass smoothing as post-processing improves the AAI performance. Sparse smoo...
متن کامل